Predicting innovative firms using web mining and deep learning
نویسندگان
چکیده
Evidence-based STI (science, technology, and innovation) policy making requires accurate indicators of innovation in order to promote economic growth. However, traditional from patents questionnaire-based surveys often lack coverage, granularity as well timeliness may involve high data collection costs, especially when conducted at a large scale. Consequently, they struggle provide makers scientists with the full picture current state system. In this paper, we propose first approach on generating web-based which have potential overcome some shortcomings indicators. Specifically, develop method identify product innovator firms scale very low costs. We use firm-level survey (German Community Innovation Survey) train an artificial neural network classification model labelled (product innovator/no innovator) web texts surveyed firms. Subsequently, apply hundreds thousands Germany predict whether are innovators or not. then compare these predictions patent statistics, extrapolation benchmark data, regional The results show that our produces reliable has be valuable highly cost-efficient addition existing set indicators, due its coverage granularity.
منابع مشابه
integrating web content mining into web usage mining for finding patterns and predicting users’ behaviors
with the increased confidence in the use of the internet and the world wide web, the number of electronic commerce (e-commerce) transactions is growing rapidly. therefore, finding useful patterns and rules of users’ behaviors has become the critical issue for e-commerce and can be used to tailor e-commerce services in order to successfully meet the customers’ needs. this paper proposes an appro...
متن کاملDeep Web Data Mining
World Wide Web (WWW) is broadly divided into two categories: one is Surface web that contains 1% of information content of the web and is crawlable by traditional search engines (like Google, Alta vista etc.) and second is deep web( or Hidden Web) that contains 99% of information content of the web. Most of this information is contained in the databases and is not indexed by search engines. As ...
متن کاملDeep Web Content Mining
The rapid expansion of the web is causing the constant growth of information, leading to several problems such as increased difficulty of extracting potentially useful knowledge. Web content mining confronts this problem gathering explicit information from different web sites for its access and knowledge discovery. Query interfaces of web databases share common building blocks. After extracting...
متن کاملPredicting Process Behaviour using Deep Learning
Predicting business process behaviour is an important aspect of business process management. Motivated by research in natural language processing, this paper describes an application of deep learning with recurrent neural networks to the problem of predicting the next event in a business process. This is both a novel method in process prediction, which has largely relied on explicit process mod...
متن کاملWeb Usage Mining Using Distributed Learning Automata
One of the most important issues in web mining is how to find out similarities between web pages. In this paper we propose a method based on distributed learning automata which take advantage of usage data to find out web pages similarities. The idea of the proposed method is that if different users request a couple of pages consistently together, then these pages are likely to correspond to th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: PLOS ONE
سال: 2021
ISSN: ['1932-6203']
DOI: https://doi.org/10.1371/journal.pone.0249071